274

18

Transcriptomics and Proteomics

is practicable up to about 20 cycles and is highly appropriate wherever linear

heteroöligomers sharing a common chemistry are required.

2. For all other cases, minute amounts of the receptor substances are directly

deposited on the array (e.g., using laboratory microrobotics combined with inkjet

technology for applying solutions of the different substances). This is suitable for

large macromolecules, such as proteins, or sets of molecules of substances not

sharing a common chemistry, or longer oligopeptides.

In both cases, each patch is uniquely identified by its Cartesian array coöordinates.

Sample preparation. The raw material is processed to make available the analyte(s)

of interest and possibly partially purified. The mRNA is typically used to generate

a set of complementary DNA molecules (cDNA), which may be tagged (labelled)

with a fluorescent or other kind of label.

Array exposure. The array is flooded with the sample and allowed to reach equi-

librium. Then all unbound sample is washed away. If the analyte was not tagged,

tagging can be carried out now on the chip (e.g., by flooding with a hybridization-

specific dye 6) after removing the unbound molecules, which has the advantage of

eliminating the possibility of the tag interfering with the binding.

Array reading. The array is scanned to determine which patches have captured

molecules from the sample. If the sample molecules have been tagged with a fluo-

rophore, then fluorescent patches indicate binding, with the intensity of fluorescence

giving some indication of the amount of material bound, which, in turn, should be

proportional to the amount of mRNA present in the original sample.

Image processing. The main task is to normalize the fluorescent (or other) intensities.

Normalization is important when comparing the transcriptomes from two samples

(e.g., taken from the same tissue subject to two different growth conditions). A

straightforward way of achieving this is to assume that the total amount of expressed

mRNA is the same in both cases (which may not be warranted, of course) and to divide

the intensity of each individual spot by the sum of all intensities. If the transcriptomes

have been labelled with different fluorophores and exposed simultaneously to the

same chip, then normalization corrects for differences in fluorescence quantum yields

and the like.

Analysis. The procedures followed for supervised hypothesis testing will depend on

the details of the hypothesis (Sect. 13.1). Very commonly, unsupervised exploratory

analysis of the results is carried out which, in effect, uses no prior knowledge but

explores the data on the basis of correlations and similarities. One goal is to find

groups of genes that have correlated expression profiles, 7 from which it might be

inferred that they participate in the same biological process. Another goal is to group

6 For example, ethidium bromide, the fluorescence of which becomes about 20-fold stronger after

it is intercalated into double-stranded DNA.

7 An expression profile is defined as a two-column table, with conditions in the left-hand column

and the corresponding (relative) amounts of expressed proteins (possibly as RNA) in the right-hand

column.